Adapting k-means for graph clustering

نویسندگان

چکیده

Abstract We propose two new algorithms for clustering graphs and networks. The first, called K?algorithm , is derived directly from the k -means algorithm. It applies similar iterative local optimization but without need to calculate means. inherits properties of in terms both good capability tendency get stuck at a optimum. second algorithm, M-algorithm gradually improves on results K -algorithm find potentially better optima. repeatedly merges splits random clusters tunes with -algorithm. Both are general sense that they can be used different cost functions. consider conductance function also introduce functions, inverse internal weight mean . According our experiments, M outperforms eight other state-of-the-art methods. perform case study by analyzing disease co-occurrence network, which demonstrate usefulness an important real-life application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Graph based k-means clustering

An original approach to cluster multi-component data sets is proposed that includes an estimation of the number of clusters. Using Prim’s algorithm to construct a minimal spanning tree (MST) we show that, under the assumption that the vertices are approximately distributed according to a spatial homogeneous Poisson process, the number of clusters can be accurately estimated by thresholding the ...

متن کامل

Adapting k-means for Clustering in Big Data

Big data if used properly can bring huge benefits to the business, science and humanity. The various properties of big data like volume, velocity, variety, variation and veracity render the existing techniques of data analysis ineffective. Big data analysis needs fusion of techniques for data mining with those of machine learning. The k-means algorithm is one such algorithm which has presence i...

متن کامل

K-Means Graph Database Clustering and Matching for Fingerprint Recognition

The graph can contain huge amount of data. It is heavily used for pattern recognition and matching tasks like symbol recognition, information retrieval, data mining etc. In all these applications, the objects or underlying data are represented in the form of graph and graph based matching is performed. The conventional algorithms of graph matching have higher complexity. This is because the mos...

متن کامل

Persistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm

Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...

متن کامل

Balanced K-Means for Clustering

We present a k-means-based clustering algorithm, which optimizes mean square error, for given cluster sizes. A straightforward application is balanced clustering, where the sizes of each cluster are equal. In k-means assignment phase, the algorithm solves the assignment problem by Hungarian algorithm. This is a novel approach, and makes the assignment phase time complexity O(n), which is faster...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Knowledge and Information Systems

سال: 2021

ISSN: ['0219-3116', '0219-1377']

DOI: https://doi.org/10.1007/s10115-021-01623-y